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BLOCK-APPENDED CHECKSUMS 



Background of the Invention 

2. Field of Invention 

This invention relates to data storage systems. 
2. Related Art 

Many computer applications need to store and retiieve information. 
Information can be stored on hard disks, floppy disks, CD-ROMs, semiconductor 
RAM memory and similar storage devices. Many of these storage systems are 
susceptiT)le to data loss of various forms including disk failures. A solution to the 
problem of disk failure involves use of a RAID (redundant array of independent 
disks) system. One style of RAID systems uses multiple hard drives to store parity 
data generated from tiie data drives, eitiier on a separate drive (known as the parity 
disk) or spread out among the multiple drives. The use of multiple hard drives makes 
it possible to replace faulty hard drives without going off-line; data contained on a 
drive can be rebuilt using the other data disks and the redundant data contained in the 
parity disk. If a hard drive fails, a new hard drive can be inserted by "hot-swapping" 
: drives while on-line. The RAID system can rebuild the data on the new disk using 
the remaining data disks and the redundant data of the parity disk. The performance 
of a RAID system is improved by disk stiiping, which interleaves bytes or groups of 
bytes across multiple drives, so more than one disk is reading and writing 
simultaneously. Files are broken into chunks of data known as file blocks and these 
file blocks are stored in one or more physical sectors of one or more hard disks. Each • 
file block is a given size such as 4,096-bytes that takes up 8 sectors. 

A first known problem with storage devices is that they are susceptible to data 
corruption. This data corruption includes bit flips, misdirected I/O, lost I/O, sector 
shifts, and block shifts. One style of RAID uses parity data to determine whether 
there has been corruption of some data included in a disk sfaipe. Parity is checked by 
comparing the parity value stored on disk against the parity values computed in 

1 
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memory. Parity is computed by taking the exclusive-OR (henceforth "XOR") of the 
blocks in the data stripe. If the stored and computed values of parity are not the 
same, the data may be corrupt. If a single disk block is incorrect, the RAID systmi 
includes enough data to restore flie corrupted block by recalculating the corrupted 
data using the parity data and the remaining data in the data stripe. However, such 
RAID systems can not determine which disk includes the comipt data from parity 
values alone. Although parity data is useful in determining whether corruption has 
occiirred, it does not include enough information to restore the corrupted data. 
Moreover, it is unclear which data has been corrupted. 

Checksums are another form of redimdant data that can be written to 
individual disks. The combination of parity bits across the disks along with 
checksums and their associated information may include enough information so that 
the cormpted data can be restored in RAID and other redundant systems. 



15 



A second known problem involves using a sector checksum for each sector of 
data. A sector checksum is generated for each collection of data that can fill a sector. 
The data is stored in a disk sector, along wifli the associated sector checksum. Some 
known systems include reformatting a collection of hard disks from standard sector 

20 sizes such as 512-byte sectors to include sector checksimis in each sector such as 

refonnattiiig to 520-byte sectors. Data corruption in the disk sector can then be 
detected by using the sector checksum because the stored checksum would not match 
a computed checksxmi. However, data corruption such as sector slides, misdirected 
reads, and writes, and lost sectors would not be detected at the disk sector level. For 

25 this type of corruption, a checksum computed from the sector data would match flie 

stored checksum. 

A third known problem is storing checksums in reserved locations separate 
from the associated data. A separate read or write operation of the checksum is 
30 reqiiired for every read or write opemtion of the associated data. This can result in 

performance loss in some workloads. 

Accordingly, it would be advantageoxis to provide an improved technique for 
th^ error checking and correction of data storage systems. This is achieved in an 

2 
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embodiment of the invention lhat is not subject to the drawbacks of the related art. 



Summary of the Invention 

The invention provides an improved method and apparatus for a reliable data 
storage system using block level checksums appended to data blocks. 

In a first aspect of the invention, a block-appended checksum is CTeated at the 
filesystem block level, where a block is the filesystem's unit of transfer. In a preferred 
embodiment, the data storage system is a RAID system composed of multiple hard disk 
drives, including a parity disk drive and a controller for the drives. Files are stored on hard 
disks in storage blocks, including data blocks and block-^p^ded checksums. The block- 
appended checksum includes a checksum of flie data block, a Virtual Block Number (VBN), 
a Disk Block Number (DBN), and an embedded checksum for checking the integrity of the 
block-appended checksum itself. The block-appended checksum reliably detects conuption 
of data within a sector such as bit flips, as does a sector checksum. However, a block- 
appended checksum also reliably detects data corruption across sectors including 'sector 
slides, misdirected reads and writes, and lost sector I/O. 

The combination of (1) parity bits across the RAID system stored on the 
parity disk, (2) the remaining uncorrupted data in the data disks, and (3) block-app^ded 
checksum within each disk includes sufficient information so as to enable detection and 
restoration of corrupt data in RAID systems and other similar devices. Such a combination 
is preferable to using block-appended checksums alone because block-appended checksums 
are limited to detecting OTcors. 

In a second aspect of the invention, a file system includes file blocks with 
associated block-appended checksum to the data blocks. The file blocks with block- 
appended checksums are written to storage blocks. In a preferred embodiment a collection 
of disk drives are formatted with 520 bytes of data per sector instead of the more commonly 
found 512 bytes per sector. For each 4,096-byte file block, a corresponding 64-byte block- 
appended checksum is appended to the file block. When this is written to disk, the first 7 
sectors includes most of the file block data while the 8*^ sector includes the remaining file 
block data and the 64-byte block-appended checksum for a total of 4,160-bytes of data. 

3 
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Because the block-appended checksums are appended to the file blocks, every read or write 
to a storage block includes the x^i„g or ^vriting of the file block and the block-appended 
checksum in a single operation, m known cases, this results in greatly improved 
pCTformance. 

5 

fa a third aspect of the invention, I/O operations are first stored in NVRAM 
(non-volatile random access memory). Jn the event of a system crash. I/O operations are 
relayed from NVRAH which preserves file block data. When the I/O operation is 
performed again, the corresponding block^appended checksum information is simply 
10 recalculated. 

In a preferred embodiment, the inv«ition is operative on a RAID level 4 
system for a file server. However, in other embodiments, the invention is applicable to any 
^ computer data storage system such as a database system or a store and forward system such 
15 as cache or RAM. 



Brief Descr iption of the Drawinff g 

Figure 1 shows a block diagram of a reliable, redundant data storage system 
20 including block-appaided checksum. 

Figure 2 shows a flow diagram of a method for writing file blocks with block- 
appended checksums to a reliable, redundant data storage system. 



25 



30 



Figure 3 shows a flow diagram of a method for reading data from data storage 
blocks including file blocks with block-appended checksums in a reliable, redundant data 
storage system. 

I ■ • 

Detailed Description of th^ Preferred F.mWi^^. . 

In the following description, a preferred embodiment of the invention is 
descnbed with regard to preferred process steps and stiuctiires. TTiose skilled in the art 
would recognize after penisal of this application that embodiments of the invention can be 
unplemented using elements adapted to particular process steps and structures described 

4 
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herein, and that implementation of the process steps and structures described herein would 
not require undue experimentation or further invention. 

Incorporated Disclosures 

The inventions described herein can be used in conjunction with inventions 
described in the following applications: 

• International Application No. PCT/USOl/25901, entitled"Manipulation of Zombie 
Files and Evil-Twin Files," in the name of Network AppUance, Inc., ffled on 17 
August 2001, with a priority date of 18 August 2000 

• International Application No. PCT/USOl/25822, entitled "Improved Space 
Allocation in a Write Anywhere File System," in the name of Network Appliance, 
Inc., filed on 17 August 2001 , with a priority date of 1 8 August 2000 

• International Application No. PCT/USOl/25763, entiUed "Instant Snapshop," in the 
name of Network Appliance, Inc., filed on 17 August 2001, with a priority date of 18 
August 2000 

Lexicography 

As used herein, use of the following terms refer or relate to aspects of the 
invention as described below. The general meaning of these terms is intended to be iUusory 
and in no way limiting. 

• Sector - In general, the term "sector" refers to a physical section of a disk drive 
including a collection of bytes, such as 512 or 520 bytes. This is the disk drive's 
minimal unit of transfer. 

• Storage block - In general, the term "storage block" refers to a group of sectors, 
such as 8 sectors or 4,096 bytes for 512 byte sectors, or 4,160 bytes for 520 byte 
sectors. 
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Data block - In general, the tenn "data block" refers to a collection of bytes stored 
in a storage block, such as 4,096 bytes with 512-byte sectors or 4,1 60 bytes with 520- 
byte sectors each stored in 8 sectors. 

Block-appended checksum - Jn general, the tenn "block-appended checksum- 
refers to a coUection of bytes, such as 64 bytes, which may include a checksum of a 
data block, a Virtual Block Number (VBN), a Disk Block Number (DBN) and an 
embedded checksum for checking the integrity of checksum information itself. 

Stripe - M general, the tenn "stripe" refers to the collection of blocks in a volume 
with the same DBN on each disk. 

V.Id,« - b ge„^ ae tenn "volme" rrfte to . single file system spread aemss 
mulflple disto and.a«>cia.«J.d«k driv«. Known datt storage syaems have current 
su» Inmts, such as greater than terabyte and are included in multiple volumes. 

DBN (Disk Block Number) - In general, the tem, "DBN" refers to a location of a 
particular block on a disk in a volume of a file system 

VBN (Volume Block Number) - In general, the tenn "VBN" refe to an integer 
that maps to a disk number and disk block number. 

WAFL (Write Any^U^ HI. Layout) _ general, ahigh level stnuiur. for a file 
system that ia above RAID in Weraniy and includes m«ada.a, such as one or more 
copies of the block" (file system inf«mation block) located at fixed locations 

on d«k. Poimers are used for loc«ing flre remaining data. All the data except the 
6mfo blocks are coUected into files ahd these files can be written anywh«e on the 
disk. 



Parity checktog - to general, the tem. '^adty checking" refers to an enx.r detection 
tahmque that tests the integrity of digital data within a computer system or over a 
network. Parity bits are checked by comparing them against computed values of 
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parity, which are the XOR of the sets of data bits. 



• Parity disk - In general, the term "parity disk" refers to a separate disk drive that 
holds parity bits in a disk array, such as four data disks and one parity disk in a 

5 volume of a data storage system. 

• Parity protected - In general, the term "parity protected" refers to protection of a 
collection of data using parity bits. Data is parity protected if it has parity for an 
entire collection of data. In a preferred embodiment, parity computations can be 

10 made across bytes. 

• Checksum - In general, the terai "checksum" refers to a value used to ensure data is 
stored or transmitted without error. This value is created by calculating the binary 
values in a block of data using some algorithm and storing flie results with the data or 

15 at a separate location. When the data is retrieved from memory, received at tHe other 

end of a network, or retrieved from a computer storage system, a new checksum is 
computed and matched against the existing checksum. A mismatch indicates an 
error. 

20 As described herein, the scope and spirit of the invention is not limited to any 

of the definitions or specific examples shown therein, but is intended to include the most 
general concepts embodied by these and other terms. 

System Elements 

25 

Figure 1 shows a block diagram of a reliable, redundant data storage system 
including block-appended checksums. 

A data storage system 100 includes a controller CPU (central processing unit) 
30 105, an I/O port 1 10, a file system 1 15, a RAID system 125, a disk driver 135, a host/disk 
adapter 145, a hard disk collection 150, including drive 155, drive 160, drive 165, drive 170 
and parity drive 175. 
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A data storage system 100 is part of a larger computer system. The I/O port 
1 10 is comiected to the larger computer system in such a way that the controller CPU 105 
can send data to and from the I/O port 1 1 0. TTie data is written to and read from the hard 
disk collection 150, including a parity disk 175 in a data storage system 100. 

Unlike other parity systems that may require breaking up the bytes in a block 
of data or breaking up the block of data itself, each bit in the parity block is computed using 
the corresponding bits in the data blocks. Thus, if there are four blocks of data, one block 
would be put on a first drive 155, the second block would be put on drive 160, the third 
block would be put on drive 165 and the fourth block on drive 170. The parit^ block is 
computed using an XOR of the data blocks. 

hi a preferred embodiment, the five disk drives 155, 160, 165, 170 and 1 75 in 
a RAID syst«n 125 include one or more volumes. A volume is a sin^e file system in a data 
storage system. Each blodc has a unique VBN (volume block numba) and DBN (disk block 
number). The VBN specifies the location of a block in a volume. The DBN specifies the 
locationof ablock in a disk: Therefore, more than one block can have the same DBN if they 
are in the same location on different disks. However, only one block can have a given VBN. 

Known data storage systems format hard disks with 512-bytes per sectors. 
Prior art systems with checksums may include disks formatted with 520-byte sfectoi^ 
comprising 512-bytes of file block data and 8-bytes for a sector checksum. In a preferred 
embodiment, each disk in a hard disk collection 150 is fonnatted with 520-bytes per sector. 
Files are broken into fixed sizes of data known as file blocks. These file blocks are stored in 
one or more physical sectors of one or more hard disks such as 4.096-bytes that take up 8 
sectors. With a hard disk formatted to 512-byte sectors, the file block fits into 8 sectors with 
no extra bytes remaining. With a hard disk fonnatted for 520-bytes per sector, the 4,096- 
byte file block fits into the 8 sectors with 64 bytes free for a block-appended diecksum.' TTie 
first 7 sectors contain only file block data while the 8* sector includes the remaining file 
block data and ends with the 64-byte block-appaided checksum. This 520-bytes per sector 
formatting approach allows the file block and checksum to be written or read in a single 
operation. The resulting block-appended checksum has an advantage over the prior art 
sector checksums in a 520-byte formatted hard disk because it can reUably detect sector data 
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reads and writes, lost sectors and similar 



In a preferred embodiment, a series of software and hardware layers is 
required for reading and writing data between the CPU 105 and the hard disk collection 1 50. 
A file system 1 15 takes a relatively large data file and divides it into a group of file blocks of 
a given size such as 4,096-bytes. A RAID system stripes these file blocks across a collection 
of hard disks such as a hard disk collection 150 including four data disks, disk 1 155, disk 2 
160, disk 3 165, and disk 4 170 plus a parity disk 175 that provides redundancy for the data 
High performance is accomplished using the RAJD system by breaking the group of file 
blocks into four sub groups and stripmg these sub groups of blocks in parallel across the data 
disks. Each file block in a RAID system receives a block-appended checksum of a given 
size such as 64-bytes, The block-upended checksum is upended to the file block to 
produce a file block with a block-appended checksum of 4,160-bytes. The block-appended 
checksum information mcludes at least: a 4-byte checksum of the data block; a#irtual Block 
Number (VBN); a Disk Block Number (DBN); and a 4-byte embedded checksum for 
checking the integrity of the block appended checksum itself. Other embodiments may use 
other formats of data and algorithms other than Adler's. A sector checksum and a block- 
appended checksum reliably detect bit flips. However, only a block-appended checksum 
reliably detects sector data corruption including sector slides, misdirected reads and writes, 
and lost sectors. 

y 

In a preferred embodiment, the file system 1 15 allocates a collection of 4,096- 
byte buffers for each file block when writing a stripe of blocks to the hard disk collection 
150. Each file blodc has the same DBN in a stripe provided the hard disk collection 150 is 
composed of equal sizes of hard disks. Each 4,096-byte file block 120 is written to a 4,096- 
byte buffer and sent to RAID 125. In RAID 125, each 4,096-byte buffer is appended with 
64-bytes for a total block size of 4,160 bytes to accommodate the 64-byte block-appended 
checksum 130. The I/O operations are logged to NVRAM. If the system crashed after this 
point, the file blocks can be restored upon recovery fi-om the crash by replaying the log of 
I/O operations from NVRAM. Each 4,096-byte file block plus 64-byte checksum 140 is sent 
to the disk driver 135. The disk driver 135 creates a scatter/gather list that provides 
instructions where host/disk adapter 145 should distiibute each file block plus 64-byte 
checksum 140. The collection of buffers and the scatter/gaflier Hst are sent to the host/di^ 

9 
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adapter 145. The host/disk adapter 145 then writes the stripe of file blocks with the block- 
appended checksums to the hard disk collection 150 including the four hard disks, disk 1 
155, disk 2 160, disk 3 165, disk 4 170. The parity data is created from the stripe of file 
blocks and it is writt^i onto the parity disk 1 75. A file block with block-appended checksum 
5 180 is written to a storage block on disk 1 155 that is composed of 8 sectors of the disk. 
There is a single operation for writing the file block with appended checksum. The file 
block data fills all 8 sectors with space remaining in the last part of the last sector to hold the 
block-appended checksum. Wh^ a file is read, each file block and associated block- 
appended checksum is also done as a single operation. The stored block-appended 
10 checksum is compared with a computed block-appended checksum to validate the data. If 
the stored and computed block-appended checksums are not equivalent, the data has been 
corrupted and must be rebuilt using the remaining hard disks including the parity disk 175 in 
the hard disk collection 150. 

15 Method of Use 

Figure 2 shows a flow diagram of a method for writing file blocks wifli block- 
appended checksimis to a reliable, redundant data storage system. 

20 A method 200 is performed by the data storage system 100* Although the method 

200 is described serially, the steps of the method 200 can be performed by separate elements 
in conjunction or in parallel, whether asynchronously, in a pipelined manner, or otherwise. 
There is no particular requirement that the method 200 be performed in the same ord^ in 
which this description lists the steps, except where so indicated. . , 

25 

At a flow point 205, the data storage system 100 is ready to pofonn the method 200 
to a file system 115 including writing file blocks and block-appended checksums. In the 
prefeired embodiment the write mefliod 200 requires fonnattmg hard disks to 520 byte 
sectors. 

30 

At a step 210, the data storage system 100 receives a request firom the us€ar to write a 
file block to the file system 115. 



At a step 215, the data storage system 100 allocates and fills a 4,096-byte 

10 
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buffer with a file block. 

At a step 220, the data storage sjretem 100 sends the filled 4,096-byte buffer 

to RAID 125. 

At a step 225, the data storage system 1 00 allocates a 64-byte bufifer in RAID 

125. 

At a flow point 230, the data storage system 100 computes a block-appended 
checksum for the 4,096-byte file block in the 4,096-byte buffer, fills the 64-byte buffer with 
the block-upended checksum and app^ds the 64-byte buffer to the 4,096-byte buffer. 

At step point 235, tiie data storage system 1 00 sends the 4,096-byte file block 
buffer including the 64-byte block-app^ded checksum buffer to the disk driver 135. 

At a step point 240, the data storage system 100 creates a scatter/gather list 
using the disk driver 135 to distribute the 4,096-byte file block with appended checksum to a 
group of sectors making up a storage block on one or more of the disks in the hard disk 
collection 150. 

At a step 245, the data storage system 100 sends the 4,096-byte 'buffer, 
including the appended 64-byte buffer and the scatter/gather list to the host/disk adapter 145. 

At a step 250, the data storage system 100 writes the file block with the 
block-app^ded chedcsum to a storage block in a single op^tion. 

At a step 255, the data storage system 100 completes writing to one or more 
of the hard disks in the hard disk collection 150. 

At a step 260, the data storage system 100 frees up the 64-byte buffer in 

RAID 125. 



At a flow point 265, the data storage system 100 has succeeded or failed at 
writing a file to the file system. 

11 
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Figure 3 shows a flow diagram of a method for reading data from data storage blocks 
including file blocks with block-app«ided checksums in a reliable, redundant data storage 
system. 

5 

A read method 300 is performed by the data storage system 100. Although the read 
method 300 is described serially, the steps of the read mefliod 300 can be performed by 
separate elements in conjunction or in parallel, whether asynchronously, in a pipelined 
manner, or otherwise. There is no particular requirement that the read method 300 be 
10 performed in the same order, in which this description lists the steps, except wh»e so 
indicated. 

At a flow point 305, flie data storage system 100 is ready for requests to read file 
blocks from a file system 1 15, including reading file blocks and block-appended checksums. 

15 

At a step 310, the data storage system 100 receives a request from the user to read a 
file block to die file system. 115. 

At a step 315, the dsda storage system 100 allocates a 4,096-byte buffer for a 

20 file block. 



At a step 320, the data storage system 100 sends the empty 4,096-byte buffer 

to RAID 125. 



25 



30 



At a step 325, the data storage system 100 allocates a 64-byte buffer in RAID 
1 25 and appends it to the 4,096-byte buffer. 

At a flow point 330, the data storage system 100 sends the 4i096-byte file 
block buffer with the 64-byte block-appended checksum buffer to the disk driver 135. 

At a step point 335, the data storage system 1 00 creates a scatter/gather list in 
the disk driver 135 to collect the 4,096-byte file block from a group of sectors making up a 
storage blodc on one or more of flie disks in the hard disk collection 1 50. 



12 



wo 03/009286 PCT/USOl/51321 

At a Step 340, the data storage system 100 sends the 4,096-byte buffer with 
the appended 64-byte buffer along with the scatter/gather list to tiie host/disk adapter 145. 

At a step 345, the data storage system 1 GO reads the file block with the block- 
5 appended checksum from a storage block in a single operation. 

At a step 350, the data storage system 100 completes reading the file block 
with the block-appended checksum from one or more of the hard disks collection 150. 



10 



At a step 355, the data storage system 100 computes the block-appended 
diecksum for the file block and compares it to the appended block-appended checksum to 
verify the file block. 



At a step 360, the data storage S3^em 100 frees up the 64-byte buffer in 

15 RAID 125. 

At a flow pomt 365, the data storage system 100 has succeeded or failed at 
reading a file block from the file system 115. 

20 Alternative Embodiments 

Although preferred embodiments are disclosed herein, many 
variations are possible which ronain within the concept scope, and ^irit of Ae invention, 
and these variations would become clear to those skilled in the art after perusal of this 
25 q)plication. 
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Claims 



1: An apparatus including a mass storage device, said mass storage 
device having a plurality of sectors, said apparatus including 

a plurality of storage blocks, each said storage block including a plurality of said 

sectors; 

wherein each said storage block includes a data portion arid an error code portion; 
wherein said data portion is responsive to data for said data block; and 

wherein said error code portion is responsive to data for a plurahty of said 
sectors in each said storage block. 

2. An apparatus as in claim 1, wherein said mass storage device includes 
one or more hard disks. 

3. An apparatus as in claim 1 , wh^in said mass storage device includes 
a RAID storage device. 

4. An apparatus as in claim 3, wherein said RAID storage device is a 
RAID level 4 device. 

5. An apparatus as in claim 1, wherein said error code portion is 
appended to said data portion. 

6. An apparatus as in claim 1, wherein said error code portion includes a 
checksum of the said data for said data block. 

7. An apparatus as in claim 6, wherein said checksum of said data for 
said data block includes 4-bytes of checksum data. 

8. An apparatus as in claim 6, wherein said checksum is a block- 
appended checksimi. 



9. An apparatus as in claim 8, wherein said block-appended checksum 
includes a checksum of said block-appended checksum. 

14 
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10. An apparatus as in claim 9, wherein said checksum of said block- 
appended checksum includes 4-bytes of data. 

5 1 1. An apparatus as in claim 1, wherein said mass storage device includes 

a cache or RAM. 

12. An apparatus as in claim 1, wherein said mass storage device includes 
one or more hard disks formatted with 520-bytes per sector. 

10 

13. An apparatus as in claim 1, wherein said plurality of said sectors is 

eight sectors. 

14. An apparatus as in claim 1, wherein said error code portion includes 
1 5 64-bytes of error code data. 

■ • i 

15. An apparatus as in claim 1, wherein said data portion includes 4,096- 

bytes of data. 

20 16. An apparatus as in claim 1, wherein said sectors include 520-bytes of 

data storage. 

1 7. An apparatus as in claim 1 , wherdn said storage block includes 4, 1 60- 
bytes of data and error code storage space. 

25 

18. An apparatus for protecting a mass storage device from data storage 
errors, said mass storage device having a plurality of sectors, said apparatus including 

a plurality of storage blocks, each said storage block including a plurality of 

said sectors; 

30 wherein a &st subset of each said storage block is responsive to data for said 

storage block; 

^ wherein a second subset of each said storage blocks is responsive to error 

code information; and 
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wherein said error code information is responsive to data for a plurality of said 
sectors in each said storage block. 



19. An apparatus as in claim 18, wherein said mass storage device 
includes one or more hard disks. 

20. An apparatus as in claim 18, wherein said mass storage device 
includes a RAID storage systenL 

21. An apparatus as in claim 20, wherein said RAID storage system is a 
RAID level 4 system. 



22. An apparatus as in claim 18, wherein said second subset is appended 
to said first subset. 



23 .An apparatus as in claim 18, wherein said error code information includes 
a checksum of said data for said storage block. 

24. An apparatus as in claim 23, wherem^ said checksum of said data for 
said storage block includes 4-bytes of checksum data. 

25. An apparatus as in claim 23, wherein said checksum is a block- 
appended checksum. 

26. An apparatus as in claim 25, wherein said block-app^ded checksum 
includes a checksxmi of said block-appended checksum. 

27. An apparatus as in claim 26 wherein said checksum of said block- 
appended checksum includes 4-bytes of data. 

28. An apparatus as in claim 1 8 wherein said mass storage device includes 
a cache or RAM, 
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29. An apparatus as in claim 1 8 wherein said mass storage device includes 
one or more hard disks formatted with 520-bytes per sector. 

30. An apparatus as in claim 18, wherein said plurality of said sectors is 

5 eight sectors. 

31. An apparatus as in claim 18, wherein said second subset includes 64- 
bytes of error code data. 

10 32. An apparatus as in claim 18, wherein said first subset includes 4,096- 

bj^es of data. 

33. An apparatus as in claim 18, wherein said sectors include 520-b3tes of 

data storage. 

15 

34. An apparatus as in claim 18, wherein said first and second^^subsets 
together include 4, 1 60-bytes of data and error code storage space. 

35. A method for protecting data jfrom data storage errors in a mass 
20 storage system, said mass storage system having a plurality of sectors, said method including 

determining a plurality of storage blocks, each said storage block including a 
plurality of said sectors; 

dividing each said storage block into a first subset and a second subset; 
generating error code information responsive to data for a plurality of said 
25 sectors in each said storage block; 

wherein said first subset is re^onsive to data for said storage block; and 
wherein said second subset is responsive to error code information. 

36. A method as in claim 35, wherein said mass storage system includes 
30 one or more hard disks. 

37. A method as in claim 35, wherein said mass storage system includes a 
RAE) storage system. 
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38. A method as in claim 37, wherein said RAID storage system is a 
RAID level 4 system. 



39. A method as in daim 35, wherein said second subset is appended to 
5 said first subset. 



40. A method as in claim 35, wherein said error code information includes a 
checksum of the said data for said storage block. 

claim 40, wherein said checksum of said data for said 
storage block includes 4-bytes of checksum data. 

42. A method as in claim 40, wherein said checksum is a block-appended 

checksum. 

15 

43. A method as in claim 42, wherem said block-appended checksum 
mcludes a checksum of said block-appended checksum. 

44. A method as in claim 43, wherein said checksum of said block- 
20 appended checksum includes 4-bytes of data. 

45. A method as in claim 35, wherein said mass storage system includes a 
cache or RAM. 

"^^^ ^ "^^^"^ ^ ^ 35, wherein said mass storage syst«n includes 
one or more hard disks formatted with 520-bytes per sector. 

47. A method as in claim 35, wherein said plurality of said sectors is eight 

sectors. 



30 



48. A method as in claim 35, wherein said second subset includes 64- 
bytes of error code data. 
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49. A method as in claim 35, wherein said first subset includes 4,096- 

bytes of data. 

50. A method as in claim 35, wherein said sectors include 520-bytes of 

data storage. 

51. A method as in claim 35, wherein said first and second subsets 
together include 4, 1 60-bytes of data and error code storage space. 

52. A method for efficiently detecting data errors in a mass storage 
system, said mass storage system having a plurality of storage blocks composed of a 
collection of sectors, including 

; reading data and error code information located m said storage blocks in a 
single operation; 

calculating run-time error code information for said data located in storage 
blocks; and ■ 

comparing said error code information with said run-time error code 

information. 

53. A method as in claim 52, wherein said mass storage system includes one or more 
hard disks. 

54. A method as in claim 52, wherein said mass storage system includes a 
RAID storage system. 

55. A method as in claim 52, wherein said RAID system is a RAID level 

4 system. 

56. A method as in claim 52, wherein said error code information is 
appended to said reading data. 

57. A method as in claim 52, wherein said error code information includes 
a checksum of the said reading dateu 
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58. A method as in claim 57, whorein said checksum of said reading data 
includes 4-bj^es of checksum data. 



59. A method as in claim 58, wherein said checksum is a block-appended 

checksum. 



60. A method as in claim 59, wherein said block-appended checksum 
includes a checksum of said block-appended checksvun. 

61. A method as in claim 60, wherein said checksum of said block- 
appended checksum includes 4-bytes of data. 

62. A method as in claim 52, wherem said mass storage system includes a 
cache or RAM. 

63. A method as in claim 52, vrheran said mass storage system includes 
one or more hard disks formatted with 520-bytes p« sector, 

64. A method as in claim 52, wherein said coUection of sectors is eight 

sectors. 



65. A method as in claim 52, wherein said error code information includes 
64-bytes of error code data. 

66. A method as in claim 52, wherein said reading data includes 4,(»6- 
bytesofdata. - 

67. A method as in claim 52, wharein said sectors include 520-bytes of 

data storage. 

68. A method as in claim 52, wherein said reading data and error code 
information together includes 4,160-bytes of data and error code storage space. 



20 



WO 03/009286 PCT/USOl/51321 

69. A method as in claim 52, including deteraiining whether said run-time 
error code information and said error code information in said storage blocks are equivalent. 

70. A method as in claim 52, including alerting said mass storage system 
5 if said run-time error code infomiation and said error code infomiation in said storage blocks 

are not equivalent. 

71 . A method as in claim 52, including retrieving said reading data if said 
run-time error code information and said error code information in said storage blocks are 

10 equivalent. 
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